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ABSTRACT 

PROBLEM TO BE SOLVED: To provide a joint body research support system which 
ran improve the reliability of a result body to be delivered to an electric 
rower company, reduce operation for entry into a joint facility research 
table by a field operator making a research into joint bodies, and 
.-:,or:en the research time. 



SOLUTION: An area power distribution block database is generated by 
inputting joint body information based upon the contents of a contract with 
a requester for the installation of joint bodies by area power distribution 
blocks as objects of research and the joint body facility research 
table is generated which is used to make a previous research of joint 
body articles, inspect differences from an actual state after the field 
research is completed, totalize information of contents different from 
joint body information based upon the contract with the mentioned 
requester, and generate the result body by inputting information generated 
ov ;t.:cging a joint cable of a joint body found by the field research by a 
. nv mica t ion cable external -diameter measurement system and entering all 
r r s a r c h results. 



COPYRIGHT: (C) 2002, JPO 
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0!)-0540S3 [JP 5054083 A] 
March 05, 1993 (19930305) 
I CHI YAM A TOSH IHARU 

NEC CORP [000423] (A Japanese Company or Corporation), JP 
{ Japan ) 

03-235452 [JP 91235452] 
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4 5.4 (INFORMATION PROCESSING — Computer Applications); 4 5.2 
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-Sect-ion:— P— Section - No-; -15-71 - Vol— Iff — Not -3 6"6"7 ~P*g . — l"4y July 
09, 1993 (19930709) 



ABSTRACT 

i' i'h i-'OSE : To automatically extract the joint information between tables 
without: any leaking even in the database composed of many tables by 
retrieving the database only by describing the conditions concerning the 
column of respective tables even when a user does not have the knowledge 
concerning the structure of the database which is a retrieval object and 
the detailed knowlledge concerning the joint information between the 
tables . 



CONSTITUTION: This system has a joint information holding means 4 to hold 



r ht- joint information between tables of a database , a joint conditions 
imperting means 1 to obtain the joint information from the joint 
information holding means 4 and output a retrieval expression to add the 

joint information with the table retrieval expression as an input and 
a database control means 2 to perform the retrieval of a database 
storage means 3 while the retrieval expression is received, and has a joint 
information extraction means 5 to extract the joint inf ormaiton between the 

tables of the database storage means 3 from the database control 
means 2 and send the extracted joint information to the joint information 
holding means 4 . 
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ABSTRACT 

PURPOSE: To realize a high-speed processing system by scanning 
simultaneously relation to be joined together, comparing to decide 

■ t\ join field value is larger, smaller, or is equal, etc., to 

■ : : :■ ioi.r.inq, and to produce the relation of result. 

\." " ; T r 'T i ON : A relational data base processing mechanism 1 accepts an 
. r. :.-y from a user and passes it to a relational operation control part 2. 
The relational operation control part 2 checks the adequacy of the inquiry. 
An execution control part 6 sorts and extracts relation operation to 
determine its execution procedure, which is passed to an interpretation 
execution part 3 and executed. Then, an outer joining execution part 10 
performs outer joining processing. In outer join operation processing, the 
values of joint field are extracted and compared successively from the 
starting record of plural sort relation to generate outer joining result 
responding to the result decided to be larger, smaller or is equal, and the 
result is inserted into the relation of results. 



~5/57"4 (rtem"~l"~frbin~friel' "350) 

DIALOG (R) File 350:Derwent WPIX 

(c) 2004 Thomson Derwent . All rts. reserv. 

015892836 **Image available** 

WPI Acc No: 2004-050671/200405 

XRPX Acc No: N04-040933 

Query optimization method in relational database system, involves 
joining pair of tables having respective primary and foreign key 
columns using specified join condition, and generating derived date 
constraint rule 

iv s r«nr Assignee: NCR CORP (NATC ) 

*:.v^n: or: GHAZAL A S; SINCLAIR P L 



Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 6662175 Bl 20031209 US 2001850765 A 20010508 200405 B 

Priority Applications (No Type Date): US 2001850765 A 20010508 

Pa teni: De t a i 1 s : 

i-'i-^ip. No Kind Lan Pg Main IPC Filing Notes 

'.C6rr 7 b Bl 15 G06F-017/30 

Abstract (Basic): US 6662175 Bl 

NOVELTY - A pair of tables having respective primary key (PK) and 
foreign key (FK) columns are joined , using PK is equal to FK as 
join condition. An initial running constraint (RC) having null range is 
created. A derived date constraint rule (DDCR) is generated, based on 
correlated value columns of join result. The RC is modified by merging 
range of RC with range of new constraint that is computed for each row 
in j oin result . 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is also included for 
computer program for optimizing queries to database . 

USE - For optimizing queries in relational database system such 
as teradata active data warehousing system and other database systems 
r.avi. ng massively parallel processing (MPP) architecture or symmetric 
iiiu.I. L iprocessing (SMP) architecture. 

ADVANTAGE - Automatically derives constraint rules for correlated 
variables, and reduces the size of intermediate result. 

DESCRIPTION OF DRAWING (S) - The figure shows a flowchart of 
algorithm for deriving and applying derived date constraint rule. 

pp; 15 DwgNo 4/9 

Title Terms: QUERY; OPTIMUM; METHOD; RELATED; DATABASE ; SYSTEM; JOIN; 

PAIR; TABLE ; RESPECTIVE; PRIMARY; FOREIGN; KEY; COLUMN; SPECIFIED; JOIN 
; CONDITION; GENERATE; DERIVATIVE; DATE; CONSTRAIN; RULE 

Derwent Class: T01 

International Patent Class (Main) : G06F-017/30 

; : " I. .; o Segment : EPI 
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015834435 **Image available** 
WPI Acc No: 2003-896639/200382 

Related WPI Acc No: 2001-496729; 2003-660928; 2003-897748 

XRPX Acc No: N03-715546 

Data tubes populate method for data management system, involves joining 
columns or domains of table with dimensions and other relations mapped 
into hypercube to populated hypercube 

Parent Assignee: DECODE GENETICS EHF (DECO-N) 

i" nvfjntor : EGILSSON A S; GU.DB JARTSSON H 

Nij::uo.r of Countries: 001 Number of Patents: 001 

\ a or: c Pa mi 1 y : 

Patent No Kind Date Applicat No Kind Date Week 

US 20030023608 Al 20030130 US 99475436 A 19991230 200382 B 

US 2002216670 A 20020808 



Priority Applications (No Type Date): US 2002216670 A 20020808; US 99475436 

A 19991230 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 

US 20030023608 Al 27 G06F-007/00 CIP of application US 99475436 

CIP of patent US 6434557 

A:-.!o-;::: (Basic): US 20030023608 Al 

NOVELTY - The method involves representing a calculated relation 
as a table supported by columns or domains and joining columns 
or domains of the table with dimensions and other relations mapped 
into a hypercube. The new relations are created from existing relations 



and table -like representations of calculated relations. 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is also included for 

'iara i; ube populace system. 

L'^E - Tor data management system. 

ADVANTAGE - Eliminates ambiguities observed in combined 
mGdsuremenis used to populate a hypercube. 

DESCRIPTION OF DRAWING (S) - The figure shows a block diagram of the 
data tube populate system. 

data tube populate system (100) 
database (101) 

online analytic processing server (102) 

online analytic processing client (103) 

network (104) 

pp; 27 DwgNo 1/14 

Title Terms: DATA; TUBE; METHOD; DATA; MANAGEMENT; SYSTEM; JOIN; COLUMN; 

DOMAIN; TABLE ; DIMENSION; RELATED; MAP; POPULATION 
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Safety busbar system, especially for a tablet production machine 
comprises system elements which are jointly capable of simultaneous 
implementation of safety, control and measurement functions 

•'a tent Assignee: KORSCH AG (KORS-N) 

Number of Countries: 001 Number of Patents: 001 

Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

DE 20304950 Ul 20030528 DE 2003U2004950 U 20030321 200344 B 

Priority Applications (No Type Date): DE 2003U2004950 U 20030321 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
DE 20304950 Ul 14 G06F-013/38 

AL-s ' :\i c: t (Basic): DE 20304950 Ul 

NOVELTY - Safety busbar system (10), especially for a tablet 
: r '.u...-\ ion machine (12) comprises at least one module (14) compatible 
with the busbar system and/or at least one busbar controller (16) and 
c;t least one busbar conductor (18). The system ensures essentially 
simultaneous implementation of at least one safety function, at least 
one control function and/or at least one measurement function. 

USE - Used as a safety system for a tablet production machine. 

ADVANTAGE - The amount of wiring and the general system complexity 
are reduced together with fault hazards. 

DESCRIPTION OF DRAWING (S) - The drawing shows a schematic view of 
the proposed safety busbar system. 
. - - -Tablet — pro d-uet-i-on- machine- -(-1-2-) — — ^ — " 

Module (14) 

Busbar controller (16) 
Busbar conductor (18) 
Actuator (20) 
Sensor (22) 

Emergency disconnector switch (24) 
pp; 14 DwgNo 1/1 

Title Terms: SAFETY; SYSTEM; TABLET ; PRODUCE; MACHINE; COMPRISE; SYSTEM; 

ELEMENT; JOINT; CAPABLE; SIMULTANEOUS; IMPLEMENT; SAFETY; CONTROL; 

MEASURE; FUNCTION 
Derwent Class: B07; X12; X25 

International Patent Class (Main) : G06F-013/38 



File Segment: CPI; EPI 
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Document mining method involves removing instance of dirty text within 
document to produce clean document and then performing data mining 
operation 

: Assjqnee: CASTE LLANOS M (CAST-I); STINGER J R (STIN-I) 

.:.-.■•■!. '.or: CASTE LLANOS M; STINGER J R 

Number of Countries: 001 Number of Patents: 001 

Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 20030046263 Al 20030306 US 2001944919 A 20010831 200341 B 

Priority Applications (No Type Date) : US 2001944919 A 20010831 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 20030046263 Al 13 G06F-007/00 

Abstract (Basic): US 20030046263 Al 

NOVELTY - An instance of dirty text within the document, is removed 
to produce a clean document. Data mining operation is then performed on 
the clean document. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(1) computer system; and 

(2) computer usable recorded medium storing document mining 

program . 

USE - For mining document containing dirty text such as 
typographical errors, misspellings, joined words, ad hoc 
abbreviations, bad grammar, cryptic tables , programming code, core 
riumps, missing or ambiguous punctuation and haphazard capitalization. 

ADVANTAGE - Allows the user to leverage existing domain knowledge 

-?r:c: b!.es easy customization of document containing dirty text 
.v;o i - f A i. nq to the domain and task requirements. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
l he document: mining system. 

pp; 13 DwgNo 2/5 

Title Terms: DOCUMENT; MINE ; METHOD; REMOVE; INSTANCE; DIRT; TEXT; DOCUMENT 

; PRODUCE; CLEAN; DOCUMENT; PERFORMANCE; DATA; MINE; OPERATE 
Derwent Class: T01 

International Patent Class (Main) : G06F-007/00 
File Segment: EPI 
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Replication system for database management in firm, matches data items 
in several master tables using preset data in the tables and 
generates replica table by replicating the matched data items 

Pacenc Assignee: YOKOUCHI H (YOKO-I) 

:nventoir: YOKOUCHI H 

■JjMtber of Countries: 001 Number of Patents: 001 

■ r .v; Fhm'\ I y : 

; :•■.:.■ \ 1 ' Kind Date Applicat No Kind Date Week 

/,:;,-):"/:b Al 20020711 US 2001809257 A 20010316 200267 B 



US 2002101065 A 20020320 

Priority Applications (No Type Date) : JP 2000294551 A 20000927 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 

! JS 20020091716 Al 14 G06F-007/00 Cont of application US 2001809257 

Abstract {Basic): US 20020091716 Al 

NOVELTY - The data items in several master tables (108,109) are 
matched using preset data in the master tables as a joining key. 
A replica table (121) is generated by replicating the matched data 
items of the master tables . 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is included for 
replication program. 

USE - Replication system for database including salary details in 
a firm. 

ADVANTAGE - As the data items of several master tables are 
replicated and included in a single replica table , job execution time 
of the replica system, is reduced, thus efficiency is improved. 

DESCRIPTION OF DRAWING (S) - The figure shows the data flow in the 
>; ^ p 1 i cacion system. 

Master tables (108,109) 

Replica table (121) 

pp; 14 DwgNo 5/7 
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Derwent Class: T01 

International Patent Class (Main) : G06F-007/00 

File Segment: EPI 
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Data replication system for employee management system, matches data 
items of several master tables and replicates matched data items to 
generate replica table using particular data in master table 

Patent Assignee: HITACHI LTD (HITA ); YOKOUCHI H (YOKO-I) 

Inventor: YOKOUCHI H 

Nvunber of Countries: 002 Number of Patents: 002 

■ ' j ' ■ : : L Fa m i. I y : 

■-■ 'nj Kind Date Applicat No Kind Date Week 

■ ,. Al 20020328 US 2001809257 A 20010316 200245 B 

: A 20020412 JP 2000294551 A 20000927 200245 



i r^oruy Applications (No Type Date): JP 2000294551 A 20000927 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 20020038315 Al 14 G06F-017/30 

-J-P- -2002 108 6 8-1 : - A 1-2~G0 6'F-Ol 2700 — — ~ 

Abstract (Basic) : US 20020038315 Al 

NOVELTY - A matching unit matches data items of a number of master 
tables (108,109) using particular data in the master tables as a 
joining key to join each other. A replicating unit replicates the 
:TicH.ched data items based on the joining key to generate one replica 
table (.121) . 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is also included for 
data replication program. 

USE - For replicating data of several master tables of database 
in employee management system. 

ADVANTAGE - The data of number of master tables is replicated in 
one replica table , and hence job execution time is minimized. 



DESCRIPTION OF DRAWING ( S ) - 
data replica system. 

Master tables (108,109) 
Replica table (121) 
pp; 14 DwgNo 1/7 
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The figure shows the block diagram of 
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Elbow rest has joint that is fitted to base, fixing tool that fixes joint 
to base, support that protrudes in front of base when joint is fixed to 
base, and elbow accepting unit that is attached to support 

Patent Assignee: HOGURA Y (HOGU-I) 

Number of Countries: 001 Number of Patents: 001 

Patent: Family: 

Parent No Kind Date Applicat No Kind Date Week 

X ?:Xi205]B62 A 20020219 JP 2000245062 A 20000811 200232 B 

Priority Applications (No Type Date) : JP 2000245062 A 20000811 
Pa tent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
JP 2002051862 A 8 A47B-096/18 

Abstract (Basic) : JP 2002051862 A 

NOVELTY - A fixing tool (3) fixes a joint (2) that is fitted to a 
base (1) such as a desk or table . A support articulated by the 
joint protrudes in front of the base when the joint is fixed to the 
base. An elbow accepting unit (5) is attached to the support. 

: 'SE - For e.g. desk, table . 

.-■VANTAGE - Simplifies attachment to or detachment from base. 
: • height co be adjusted. Enables work bench to be reduced in 

DESCRIPTION OF DRAWING (S) - The figure is the perspective diagram 
f ; l t he elbow rest . 
Base (1) 
Joint (2) 
Fixing tool (3) 
Elbow accepting unit (5) 
pp; 8 DwgNo 1/9 
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Relational database systems, using equijoin operations giving a merge 



join process that creates sorted set of outer table rows that satisfy 
selection criteria and enable parallel execution 

Patent Assignee: UNISYS CORP (BURS ) 
inventor: LIU L H 

Number of Countries: 001 Number of Patents: 001 

Patent Family: 

PiUMv: No Kind Date Applicat No Kind Date Week 

v ' V M ''VV7 ri 20010206 US 93135312 A 19980731 200165 B 

: : : : :-.y Applications (No Type Date): US 98135312 A 19980731 

; '■ i • <*:.:-. details : 

I'm : . ciriL No Kind Lan Pg Main IPC Filing Notes 
US 6185557 Bl 13 G06F-017/00 



Abstract (Basic) : US 6185557 Bl 

NOVELTY - When performing equijoin operations on two tables , 
as long as outer table join column value is less than or equal 
to last key value, the same data page from inner table will be 
searched repeatedly. Therefore inner table index records will not be 
revisited until outer table join column is greater than last data 
page key value. Combination of next key and last key allows merge join 
process to determine than entire ranges of outer rows do not have 
matching inner rows. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
f o 1 lowi ng : 

(1) A computer readable medium having computer executable modules 
for joining data rows from two tables . 

(2) A merge join process for joining data rows from two tables 
which have a common data column. 

USE - Relational database systems 

ADVANTAGE - Designed to minimize processor time and file input and 
output when performing equijoin operations on two tables . Reduces or 
eliminates searches of the inner index records and the data pages. 
Reduces cache thrashing on the inner index records and so the required 
index record is likely to be in the cache when needed. Minimizing the 

reversal of the index records and data pages on mass storage minimizes 
i he number of operations performed, and therefore provides a more 
efficient search process. Also , because the merge join process is 
structurally suited for execution on the multi-processor computers, the 
speed of the database queries can be increased through parallel 
processing . 

DESCRIPTION OF DRAWING (S) - Merge join process logic flow diagram, 
pp; 13 DwgNo 0/4 
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Database outputs required search data by joining data retrieved from 
tables specified by divided main keys 
Patent Assignee: SONY CORP (SONY ) 
N;j;nber of Countries: 001 Number of Patents: 001 
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r. No Kind Date Applicat No Kind Date Week 
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Al-s:. net (Basic): JP 2001005815 A 

NOVELTY - The main key corresponding to each data is divided 
according to components of the data such that each divided main key 
specifies the table (120A-120C). In order to search data, the main 
key input by operator is divided and data in each table specified by 
the divided main key is joined and output as the required result. 

USE - In database for searching data from tables . 

ADVANTAGE - The search time can be shortened by using the divided 
main key instead of unitary main key for searching purpose. Due to the 
reduction of amount of data and shortening of search time, the workload 
of maintenance operation can also be reduced. 

DESCRIPTION OF DRAWING (S) - The figure shows explanatory drawing of 
database . 

Table ( 120A-120C) 
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Incremental refresh performing method for materialized view in database 
management systems, involves deleting rows, that are attained by 
combining specific rows with changed rows of selected table 

Patent Assignee: ORACLE CORP (ORAC-N) 

Inventor: DIAS K; WITKOWSKI A 

Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 6125360 A 20000926 US 98109115 A 19980702 200107 B 
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Acs tract (Basic) : US 6125360 A 

NOVELTY - The base table of materialized view is established as 
selected table . The row (Tj) which is processed, is combined with 
both changed and unchanged rows of selected table . The rows that are 
attached due to combining of changed rows of selected ' table with Tj 
in materialized view is deleted, and the rows in which Tj combines with 
unchanged rows of selected table is left. 

DETAILED DESCRIPTION - The base table of a materialized view is 

es-ta-bl-i-s-h-ed- —as — a- sel-eoted -table -7— I f -the- selected table — is~~tn~e ~ 

right table of outer join , the materialized view is updated to 
reflect deletions to selected table by processing each row (Tj), that 
combines with changed row of selected table . If Tj combines in 
materialized view with changed rows of selected table , then the rows 
containing Tj in materialized view is removed and replaced with a row 
in which selected columns are set to NULL. INDEPENDENT CLAIMS are also 
included for the following: 

(a) database system; 

(b) incremental refresh program 

USE - For maintenance of materialized views that contain one-to-N 
lossless joins in database management system. 

ADVANTAGE - Since the incremental refresh technique does not 



require information about the order of updates to the base tables , 
the overhead associated with maintaining sequencing information is 
avoided. Since the technique is idempotent in performing an incremental 
refresh N-times on the same data, after a system crash, the incremental 
refresh operation is restarted from the beginning without taking into 
account how far the operation had progressed prior to the crash. 

DESCRIPTION OF DRAWING (S) - The figure shows the flowchart that 
illustrates the incremental refresh operation. 
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Computer aided designing/computer aided machine conversion system e.g. 
for PCB fabrication, converts CAD system data to source table , inorder 
to link data edition number with comprehensive edition number 

Patent Assignee: FUJITSU LTD (FUIT ) 

Number of Countries: 001 Number of Patents: 001 

Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 
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NOVELTY - Data of each CAD system contains intrinsic format and 
o'aca edition number with format common to a CAD system. A converter 
converts CAD system data to source table comprising information 
regarding board, mounting, node, inorder to link data editing number 
with comprehensive edition number which are managed by a management 
unit . 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 

following : 

(a) CAD/CAM conversion procedure; 

(b) recording medium with data conversion program 

USE - E.g. for printed circuit board (PCB) fabrication. 

ADVANTAGE - Automatic conversion is performed at mounting angle 
joined on output destination line as table maintains mounting 
angle information at both sides rotations for each component. Since 
data format of different CAD system is converted into an unification 
data format, each CAD data is managed with identical database . 

DESCRT-PT-ION- OF-DRAWiNG-(S-) — The -figure- shows-fire "diagram "~ 

illustrating components of CAD/CAM conversion system. 
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Access management method for shared resource in multiprocessing system, 
involves choosing subject processor to operate in DUAL mode or SOLO mode 
in response to information exchange attempt being success or failure 

Patent Assignee: INT BUSINESS MACHINES CORP (IBMC ) 

Inventor: FREITAS R F; JADAV D; KENCHAMMANA-HOSEKOTE D; MENON J M; STRONG H 

R 

Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 6105099 A 20000815 US 98203102 A 19981130 200063 B 
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NOVELTY - Each of the processors (102,104) sends self-state message 
repeatedly to other processor. The subject processor chooses operating 
mode in response to preset power-up conditions. Any information 
required to synchronize processor's lock tables is exchanged 
suitably during join operation. If attempted information exchange 
succeeds, subject processor is chosen to operate in DUAL mode, else in 
SOLO mode. 

DETAILED DESCRIPTION - The subject processor establishes message 
■■ :■]■.«'- ni. as birch cry message when lock tables (108,110) of subject 

ocessor requires information from other processor's lock table . The 
processor is operated in DUAL mode or SOLO mode only after completion 
ol join operation. When the processor in DUAL mode, the message content 
is established by processor as heart beat message. The processor 
combines with other processor to establish single processor locks on 
sub-parts of shared resources (106) to satisfy host requests to access 
the shared resource comprising digital data storage. When failure is 
indicated from other processor, the subject processor is operated in 
SOLO mode during which message content is established as death knell 
message. The processor attempts to acquire sub-part of shared resource 
and to store a predetermined code at each sub-part. The access to 
shared resource is enabled without consulting the other processor only 
:-■•/ rernent of sub-pares of shared resource. INDEPENDENT CLAIMS are 

j ucied for the following: 
access management program; 
;b) digital data processing system 

I'SE - For managing access to shared resources in multiprocessing 
systems used in data processing applications such as automated teller 
network, airline reservation systems, stock brokerage, etc. 

ADVANTAGE - Makes better use of high through-put shared resource by 
efficiently sharing the resources among hierarchically superior hosts. 

DESCRIPTION OF DRAWING ( S ) - The figure shows the block diagram of 
computer system with multiple processors. 

— Proce-s-S'ors— (tO-2-,-10^-) 

Shared resources (106) 
Lock tables (108,110) 
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Duplicate tuples elimination method in database management system 

Patent Assignee: SYBASE INC (SYBA-N) 
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Abstract (Basic) : US 5937401 A 

NOVELTY - The query for generating the tuple stream satisfying the 
selection criteria is executed, by scanning the selected database 
tables (250) according to the determined join order. The inner most 
table is scanned, thereby executing the filter which filters the 
duplicate tuples from the tuple stream. 

DETAILED DESCRIPTION - A received query specifies the selection 
criteria for the information of interest from the database system. 
The determined join order indicates the innermost and outermost 
tables of the selected join so as to guarantee that the tuples 
will stream in order during scanning of the query. A filter is ■ 
initialized at the outermost table for key columns to pass the 
initial tuple encountered from which an initial key is constructed. On 
execution the tuples having keys already encountered in the tuple 
stream are discarded by the filter attached to the innermost table . 

USE - For eliminating duplicate tuples in a generated tuple stream 
in a database management system. 

ADVANTAGE - The duplicate tuples are eliminated from the tuple 
!•>-■■•?*; without the need for performing an expensive sort operation by 
: : d-risc: ibed method. 

DESCRIPTION OF DRAWING (S) - The figure is a block diagram of a 
^ L i ent -server system with the duplicate tuples elimination method. 
Database tables (250) 
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Data processing system for accessing database - has catalogue analysis 
module which selects table within database , to create ID key for each 
set of columns in table identified as unique set 

Patent Assignee: ACTUATE SOFTWARE INC (ACTU-N) 
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NOVELTY - A dictionary builder (50) includes catalogue analysis 
:■ H ; (52) that selects table within the database , and creates ID 
r>*y i:o.r each set of one or more columns in the table identified as 
a unique set. A join key referencing selected table is created in 
any other table in the database having a set of columns with column 
names matching with that of ID key. DETAILED DESCRIPTION - A user 
interface (114) is coupled to a CPU (110). A dictionary buffer which is 
also coupled to CPU scans the database (130) to create dictionary 
containing attributes that define relationship between tables in the 
database and couples dictionary to the database . The table 
includes columns having column names. INDEPENDENT CLAIMS are also 
included for the following: information accessing method from database 
; dictionary builder. 

USE - For accessing information from relational databases . 

ADVANTAGE - Avoids need for extensive programming by system 
administrator to access database interactively. DESCRIPTION OF 
DRAWING (S) - The figure shows the block diagram of data processing 
system. (50) Dictionary builder; (52) Catalogue analysis module; (110) 
CPU; (114) User interface; (130) Database . 
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Joining method in database for one or more input tables comprising 
records stored in storage medium - using join index and minimizing number 
of input/output operations while maximizing use of small main memory 
through buffer allocation process based on join index entries 

Patent Assignee: UNIV COLUMBIA NEW YORK {UYCO ) 

Inventor: LEI H; ROSS K A 

Number of Countries: 021 Number of Patents: 002 
K: ! :er:t: Family: 
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The method involves allocating an array of partitions in a memory. 
Next a join index is read indicative of records to be joined. The join 
index entries are stored corresponding to each of one or more input 
tables of stored records and a corresponding partition identifier to 
temporary files associated with the allocated partitions. The index 
entries and the corresponding partition identifier in each of the 



temporary files are read in turn and each temporary file is sorted, 
fore ions of the tables are sequentially read only if the portion 
includes a record identified in the sorted temporary file. 

The read records are written in accordance with an order of the 
corresponding partition identifiers to separate output files associated 
with each input table . The allocation step equally distributes the 
records of the input table among the array of partitions. 

ADVANTAGE - Join technique performs join operation irrespective of 
join index order. Processes all input tables simultaneously . Can 
perform self- join operation without reading input table multiple 
times . 
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Aid preparation method for recognising database statement with data 
structure in memory device of computer system - involves determining 
ratio of number of distinct rows satisfying join statement in detail 
cable and master table for each directional link associated with 
detail and master table 

Patent Assignee: ORACLE CORP (ORAC-N) 

Inventor: TOW D S 

Number of Countries: 001 Number of Patents: 001 
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Patent No Kind Date Applicat No Kind Date Week 
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The method involves defining a set of nodes representing tables . 
A set of directional links are defined between pairs of nodes, each of 
which representing a master detail relationship between a detailed 
table and corresponding master table . A data structure referred to 
as a join free comprising representation of nodes and their 
directional links is defined in a memory device. A set of zeros or more 
selectivity factors are represented for each node. 

Each selectivity factor indicates expected fraction of rows in 
table represented by node that satisfies one or more logical 

_fpn.fi i r i nns _s at _£or-t-h- i-n -t-h e— data- acc ess- st at erne nt'— The~r at io o~f " Tiumber" 

"> ; distinct rows satisfying join statement in detail table to number 
o.i: distinct rows satisfying join statement in master table is 
determined for each directional link associated with detail table and 
master table . The probability of row in the detail table 
corresponds to that of rows in master table . 

ADVANTAGE - Improves statement execution efficiency. Produces truly 
optimise statements. 
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Query optimisation method for relation database - involves finding JOIN 
conditions to form chain to form tables according to graph join theory 
and reordering in FROM clause 
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Abstract (Basic) : US 5758335 A 

The query optimisation method involves examining the WHERE clause 
and finding all join conditions that are present. A chain of the join 
conditions is formed. 

A list of tables is established where i) tables that are in the 
WHERE clause but not in the join conditions are listed first; ii) 
tables in the join chains formed according to graph theory are listed 
next in the same order as they appear in the join chain such that 
the tables from the longest chains are listed before the tables 
; : o!n shorter chains and all tables at a given distance from a root 
table ol the chain occur together before the next level in the join 
ir. . Trie tables are reordered in the FROM clause in the list order. 

ADVANTAGE - Quickly finds access plan. Orders tables in FROM 
clause according to optimal join order in WHERE clause determined by 
graph theory. 
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GUI data access method for relational database - involves joining 
cables on common field using operator and defining logical schema 
defining table hierarchy 

Patent Assignee: INFORMIX SOFTWARE INC (INFO-N) 
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The data access method involves specifying at least one logical 
relationship by selecting a pair of tables having a common field from 
among the relational database tables . The common field of the pair 
of tables is selected as a join field . A join operator is 
selected for the join field to define a join between the pair of 
tables , the logical relationship being one of One-to-One, 
One-to-Zero-or-One, One-to-Zero-or-More , and One-to-One-or-More. 

A logical schema is created defining a multilevel hierarchy of the 
tables in the specified logical relationships. A first table and a 
second table of a pair of tables in a logical relationship are at 
an equal hierarchy level if the logical relationship is one of 
One-to-One and One-to- Zero-or-One . The first table is at a higher 
hierarchy level than the second table if the logical relationship is 
one of One- to-Zero-or-More One-to-One-or-More . 

ADVANTAGE - Avoids user having to understand physical database 
.schema to access required data. 
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Computer data base and retrieval method - analysing join statements 
using graphical technique to determine groups of tables represented as 
nodes 

Patent Assignee: INT BUSINESS MACHINES CORP (IBMC }; IBM CORP (IBMC ) 
Inventor: JACOPI T W 

Number of Countries: 004 Number of Patents: 005 



Patent Family: 














Patent No 


Kind 


Date 


Applicat No 


Kind 


Date 


Week 


FV 4 7 6810 


A 


19920325 


EP 91306546 


A 


19910718 


199213 B 


: ' 4 9 3 


A 


19940215 


US 90576022 


A 


19900831 


199407 




A3 


19931020 


EP 91306546 


A 


19910718 


199510 


; ; ; . "i j o 


Bl 


19981014 


EP 91306546 


A 


19910718 


199845 


30 350 


£ 


19981119 


DE 630350 


A 


19910718 


199901 








EP 91306546 


A 


19910718 





Priority Applications (No Type Date) : US 90576022 A 19900831 
Cited Patents: No-SR.Pub; l.Jnl.Ref 
Patent Details: 

-Pa-ten-t -No-— Kd-nd— Lan -Pg Ma~rn~~I PC Fi1Li~ng~Notes 

EP 476810 A 11 

Designated States (Regional) : DE FR GB 
US 5237493 A 10 G06F-015/40 

K.P 476810 Bl E G06F-017/30 

Designated States (Regional) : DE FR GB 
DE 59130350 E G06F-017/30 Based on patent EP 476810 



Abstract (Basic) : EP 476810 A 

Data is retrieved using a request including a set of joint 
statements, each linking the name of two tables . The data processor 
includes an element to assign priority to one table name in ech joint 
statement. An array of graph identifiers corresponding to the table 



ri^mes is formed. Each group identifier is initialised to have a value 
representing the coressponding table name. Each joint statement is 
processed in succession. 

The value of the priority name is substituted in the array of 
graph identifier values in place of each value representing the other 
name to derive an indication of the number of groups of linked names in 
the set of entered join statements. 

USE /ADVANTAGE - Assures coherency of join list without need for 
re-entering all of elements of join list 
Title Terms: COMPUTER; DATA; BASE; RETRIEVAL; METHOD; ANALYSE; JOIN; 

STATEMENT; GRAPHICAL; TECHNIQUE; DETERMINE; GROUP; TABLE ; REPRESENT; 
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Abstract (Basic) : EP 442684 A 

The method for joining selected data in two tables (12,10) in a 
relational data base system which involves defining a first 
parameter (35), selecting data in the first table which satisfies the 
first parameter and placing such selected data in order in a first sub- 
table (38). A second parameter is defined and data is selected in the 
second table which satisfies the second parameter. 

Selecting data which satisfies the second parameter involves 

dj2.f_Ln.ing- a. preliminary pa r-a meter— based- on -the— data ~in — th'e "first" sub~= 

table . Data in the second table is selected which satisfies the 
preliminary parameter and placed in order in a second sub table (45). 
Data in the second sub table which satisfies the second parameter is 
selected and combined with the data in the first sub table so as to 
join the selected data in the first and second tables (50) . 

ADVANTAGE - Highly efficient I/O operations. <14pp Dwg.No.5/5 
Title Terms: JOIN; SELECT; DATA; TABLE ; RELATED; DATA; BASE; SYSTEM; 
DEFINE; FIRST; PARAMETER; SELECT; DATA; FIRST; TABLE ; PLACE; SUB; 
TABLE ; DEFINE; SECOND; PARAMETER; SELECT; SECOND; DATA 
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Abstract (Basic): EP 421408 A 

The join operation is performed in three stages with an optional 
fourth stage, the details of which depend on the underlying join 
algorithm used. The first stage is a preparatory stage that provides 
data for the second stage to use as a basis for defining the subtasks 
for the final join operation and to allocate subtasks evenly to 
different processors, even in the presence of data skew. 

Once the second stage has completed its processing, the subtasks 
are shipped to their assigned processors for processing and the final 
join of the two relations in the third stage. Optionally, during the 
join in the third stage, there could be a dynamic reassignment of the 
subtasks should the join operation become unbalanced. 
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Abstract (Basic): US 4967341 A 

A set of sorted columns of one table stored in a DB machine 
is binary-searched by a hardware in synchronism with data transfer 
when another table is read from a disk unit to select a row to be 
joined . 

Where a key word sequence to be searched is stored in ascending or 
descending order, determination of a max. address in an area in which 
key words smaller than a search key are stored and determination of a 
max. address in an area in which the key words smaller than or equal to 
the search key are stored are in parallel executed to determine the 
storage range in one search. 

ADVANTAGE - Increases speed of join operation in relational data 
base machine . 
Dwg. 1/23 
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ABSTRACT 

PURPOSE: To greatly shorten join processing time by dividing a join 
processing into plural CPU for parallel execution. 

CONSTITUTION: The 1st table for execution of joins is decentralized to 
\ ) memories 7-10 and the 2nd table for joins is superposed on those 
:*•-■• -rif.s "--10 respectively. Here processors 3-6 perform the 1:1 comparison 
:> ■ w the partial records of the 1st and 2nd tables and store 
.-...toss i. vely the pairs of records satisfying the connecting conditions into 
■ T 1 prescribed areas of a common memory device 1. Thus a low table 
undergone the join processing is finally produced on the device 1 . As a 
result, the joint processing is possible in a short period without carrying 
out any sorting process in case one of both tables is especially large or 
smal 1 . 



14/5/5 (Item 5 from file: 347) 

DIALOG (R) File 347: JAPIO 

(c) 2004 JPO & JAPIO . All rts. reserv. 



!i!?2904 5 -''"Image available** 
SIMULTANEOUS JOINING PROCESSING SYSTEM FOR PLURAL ITEMS 



PUB. NO.: 58-166445 [JP 58166445 A] 

PUBLISHED: October 01, 1983 (19831001) 
INVENTOR (s) : OHARA TOSHISAKU 

APPLICANT (s) : FUJITSU LTD [000522] (A Japanese Company or Corporation), JP 
(Japan) 

APPL. NO.: 57-049456 [JP 8249456] 

FILED: March 27, 1982 (19820327) 

[NTL CLASS: [3] G06F-007/28 

.'APTO CLASS: 45.1 (INFORMATION PROCESSING — Arithmetic Sequence Units); 
4 5.2 (INFORMATION PROCESSING — Memory Units) 
■ ■ L:;a:.: Section: P, Section No. 246, Vol. 08, No. 1, Pg . 97, January 

06, 1984 (19840106) 



ABSTRACT 

PURPOSE: To make a line-by-line decision between acceptance and rejection 
uhrough <=2-times matching, by utilizing the previous sequencing of data in 
a table and to exclude lines after the decision from the matching and 
improve processing efficiency. 



CONSTITUTION: When an inquiry input for joint processing is received from a 
display terminal 5, relative tables are loaded from a data file 3 to a 
memory 2. Pointers A and B are initialized to indicate the uppermost lines 
of the tables . An arithmetic processing part 10 performs the matching 
between the tables A and B on the basis of retrieval conditions of shown 
elements. A pointer control part 11 increases the pointer A or B by one 



every time the matching is carried out once to indicate the next line. 
Records of a coincident line are put together in output format by an 
input/output processing part 12 and transferred to the terminal 5. 
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Selected table joins display method in relational database management 
system, involves forming line between icons representing selected tables 
in overview diagram upon accepting selected join grid row 

Patent Assignee: INT BUSINESS MACHINES CORP (IBMC ) 
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Abstract (Basic) : US 20030055830 Al 

NOVELTY - A table join grid row having potential valid table 
join for two database tables represented in a join overview 
diagram, is selected from a grid. A line is formed between the icons 
representing the selected tables in the diagram, upon accepting the 
selected join grid row. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
: n !_ 1 owi nq : 

; .1 ; program storage device storing selected valid table joins 
; '. s c 1 a y program; and 

(2) selected valid table joins display system. 

USE - For displaying selected table joins in relational database 
management system (RDBMS) of computer. 

ADVANTAGE - The join grid and the join overview diagram are 
always displayed together and are synchronized . Hence, if a user 
chooses a join in the join grid, the join overview diagram 
automatically reflects the selection by drawing the line, which 
highlights the associated row in the join grid. Each row in the join 
qrrid automatically provides a summary of all valid two-column 
rombi na t ions of the selected two tables . Thus, the user need not 
.<-■•■•! ^ci: each of the two columns in the join grid separately. Provides 

ioient table join process even for a large number of tables and 
col umns . 

DESCRIPTION OF DRAWING (S) - The figure shows a flowchart explaining 
table joins display and selection process, 
pp; 11 DwgNo 5/5 
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Abstract (Basic): KR 2001107227 A 

NOVELTY - A method for managing advertisement information is 
provided to enable a client to get profit due to advertisement 
information, and to enable an advertiser to get more efficient effect 
oj advertisement with less cost by using quiz game. 

DETAILED DESCRIPTION - A service operator interface part (301) 
interfaces communication with a computer of a service operator ( 100 ) . An 
information classifying part (302) classifies advertisement information 
provided from an advertiser, quiz information provided from the service 
operator ( 100) , member information inputted from a client, an input 
information inputted correspondingly to the quiz by the client and 
information about right answerer respectively. Then, the information 
classifying part (302) stores the classified information in 
corresponding database (303-307) respectively. If a client joins as 
member, a part of requesting input of member information ( 308 ) requests 
• ' ■-]. ient to input various personal information including ID and 
: : -.-A ^rri. At the same time , the part of requesting input of member 

. ■ iiU; l i on ( 308 ) requests the client who joined as member to input 
■ :r' :\) and the password. An external interface part(309) interfaces an 

f-v- ratling server(300) with communication networ k { 500 ) . When a client 
registered as member connects to the operator server (300) and joins in 
quiz, a quiz information displaying part (310) extracts quiz information 
stored in a quiz information database (303), and provides the 
extracted quiz information to the client, 
pp; 1 DwgNo 1/10 
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Data retrieval optimization method for relational database management 
system, involves creating aggregate join index by combining commonly used 
columns of tables with results from computed aggregation expressions 
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inventor: AU G K; HOANG C K; ON AH G K 
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KP 1164509 A2 20011219 EP 2001304927 A 20010606 200212 B 
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/Abstract (Basic) : EP 1164509 A2 

NOVELTY - The aggregation expressions from the columns of tables 
are computed. An aggregate join index is created for the tables by 
combining commonly used columns of the tables with results from the 
expressions . 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 

following 

(a) Computerized database management system; 

(b) Data structure; 

(c) Data retrieval optimization program; 

(d) Computer readable storage medium storing data retrieval 
optimization program 

USE - For mainframe, mini computer, personal computer for use in 
relational database management system (claimed) . 

ADVANTAGE - Allows users to perform aggregation operations reliably 
using the aggregate join index. Enhances parallel processing across 
multiple access module processors. 

DESCRIPTION OF DRAWING (S) - The figure shows the hardware and 
software environment. 
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Acs; i'fici- (Basic) : RD 439106 A 

NOVELTY - Relational and physical database design is driven by a 
database designer's heuristic knowledge of database design. The 
re'sent invention utilizes a set of DB2 designer rules derived for DB2 

_ ijni^ejrsal database (.UDB4 -Paxa-l-le-1— Ed-i-t-i-on— a-nd- BB2- UDB" physical 

database objects. 

DETAILED DESCRIPTION - These rules are useful to designers in the 
performance of various tasks in physical design, such as the creation 
of indexes to optimize the access path for minimal costs; the 
assignment of tables to appropriate tablespaces; or in the parallel 

database environment, table collocation, which will occur if two 
large sized tables aree involved in frequent joins . Each design 
advice action is connected to a set of heuristic rules. Through the 
graphical representation of DB2 UDB design rules users can access the 
sot: of rules and tailor the set or individual rules to meet the need of 
i heir database applications. Examples of rule sets include activate 
:. .:',us, deactivate rules and modify rules. A set of heuristic rules 



takes the provided data load and workload values when it generates a 
design proposal. If no information is available, the heuristic rule set 
will use predefined default values with its rulse. The following figure 
shows Creation of Index Rules using graphical representations. The 
advice generated using the rule set is given in the form of a report 
chat lists the design steps to consider. When design proposals are 
requested, one can either accept or reject a designn proposal. The 
lollowing two figures show proposed actions and a proposed report for a 
table object using graphical representations. 

USE - None given. 
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Edit log management system of engineering drawing, stores drawing joining 
management and authority registration tables based on whose contents 
joining of drawing data and editing authority are managed respectively 
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Abstract (Basic) : JP 2000298678 A 

NOVELTY - The data processor (1) produces 2D engineering drawing 
which is split in tree form and authority of editing are managed based 
on contents registered in drawing joining management table (28) and 
authority registration table (25). The editing of drawing is 
performed based on parent drawing attribute data (22) and contents of 
updating registration table (24). 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is also included for 

USE - for parallel editing and log management of engineering 
drawings produced during large scale construction project. 

ADVANTAGE - Since editing of drawing data corresponding to parent 
attribute data and updated data is performed and managed based on 
respective tables , editing is performed simultaneously and 
parallely by several operators, enabling drawing joining 
centralization unit to be set to the desire of user and preventing 
mist.akes_o.f_ editing, same drawi-ng— by ■- se-ve-ra-1— operator st ~ 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
edit log management system. 

Data processor (1) 

Parent drawing attribute data (22) 
i s t r a c ion tables (24, 25) 
; :w:r ( .j joining management table (28) 
l : ; o DwgNo 1/6 
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Query processing method for relational database management system, 
involves performing join operation of data streams locally, after 
detecting appropriate conjunct predicates 
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NOVELTY - Two partitioned data streams that relate to join 
operation, are received. Usage possibility of conjunct predicates to 
locally perform a parallel inner join or outer join operation, is 
determined. The join operation is performed locally, if the detection 
result approves the usage of appropriate conjunct predicates. 

USE - For parallel query processing in SQL processing relational 
database managements system (RDBMS) in computer network. 

ADVANTAGE - Enables to optimize or avoid data repart it ioning by 
recognizing the possible partitioning requirements for achieving 
parallelism for a query operation, and when the partitioning property 
of data satisfies the partitioning requirements of query operation. 

DESCRIPTION OF DRAWING (S) - The figure represents the query 
execution plan for directed join operation in DBMS. 
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Abstract (Basic) : US 5884320 A 

NOVELTY - The join operations are assigned to the processors 
using index structures. The data points are simultaneously 
redistributed and joined in the processors in parallel based on 
predetermined joining conditions. 

DETAILED DESCRIPTION - The data points among the processors are 
partitioned and an index structure for these data points is created. 
The index structure has several leaf nodes corresponding to a subset of 
;:di:a points. INDEPENDENT CLAIMS are also included for the following: 

(a) computer product; 

{b} database system. 

USE - In microprocessor system. 

ADVANTAGE - Builds index structure and performs joins efficiently 
using minimum amount of storage space due to system's parallelism. 

DESCRIPTION OF DRAWING (S) - The figure shows flowchart of overall 
operational sequence of performing join operations on data points of 
database in parallel . 
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Abstract (Basic) : GB 2287807 A 

The system joins distributed data with a join key and produces a 
joined table (100,200). Recording devices, disk drives (4a-4d) store 
the distributed data e.g. employee data (5a-5d) and sales data (6a-6d). 
Slave-processors (3a-3d) are coupled to the recording devices to 
retrieve the data and output the data. The main processor (1) receives 
the data from the slave-processors and produces the joined table . 

Each slave-processor checks a join key of the second data, sales 
ioi.d, with a join key of the first data . Based on the checking 
insult it selects the second data (400a-400d) and outputs the data to 
: he main processor. 



USE/ADVANTAGE - Provides high speed joining , eliminates burden on 
master processor as data distributed and stored in slave processors, 
part of join processing done in parallel . 
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Abstract (Basic) : GB 2235798 A 

A join processor for relational databases includes a main 
processor (1) for processing information. A main memory (2) is 
connected to the main processor to store information. A number of 
auxiliary processors (3) are controlled by the main processor. 

A number of auxiliary memories (4) are each connected to the 
corresp. auxiliary processors. A device enables the auxiliary 
processors to perform in parallel dividing and storing relations of 
relational databases in the auxiliary memories in units of record and 
extracting from the auxiliary memories and sorting join fields of the 
relations to be joined. 

USE - Fast join processor for a large size of relations to be 
joined, e.g., employee records. (21pp Dwg.No.1/6 
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51 7315 (JOIN OR JOINS OR JOINED OR JOINING OR EQUIJOIN???) (7N) (RO- 

W? ? OR TUPLE? ? OR COLUMN? ? OR FIELD? ? OR CELL? ? OR TABLE? 

?) 

52 1422 SI (7N) (SIMULTANEOUS? OR CONCURREN? OR COINCIDENT? OR PARAL- 

LEL OR SYNCHRONIZ? OR SYNCHRONIS? ? ? OR SYNCHRONOUS? OR SYNC OR 
SAME () TIME OR AS OR WHILE) 
! >49024 DATABASE? ? OR DATA () BASE? ? OR TABLE? ? 
13343 (JOIN OR JOINS OR JOINED OR JOINING OR EQUIJOIN???) (20N) (S- 
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HRONIZ? OR SYNCHRONIS??? OR SYNCHRONOUS? OR SYNC OR SAME ( ) TIM- 
E) 

55 10265 REDISTRIBUT? OR RE ( ) DISTRIBUT? 

56 5 S2(50N)S3(50N)S5 

57 0 S3(50N)S4 (50N)S5 
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HILE OR AS) 

59 6 S3 (50N) S5 (50N) S8 

510 31724 6 ROW? ? OR TUPLE? ? OR COLUMN? ? 

511 18855 S10(5N) (REDISTRIBUT? OR DISTRIBUT? OR SHUFFL? OR RESHUFFL? 

OR SHIFT??? OR RESHI FT??? OR MOV??? OR MIX??? OR REORDER??? OR 

SORT??? OR RESORT??? OR REARRANG? OR REORGANI? OR REGROUP??? 
OR RE ( ) (ARRANG? OR ORGANI? OR GROUP??? OR ORDER?)) 
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OR SAME () TIME OR AS OR WHILE) 

§1 3 ' ,• 15 f S12 (50N) Sll (50N) S3 
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51 6 4 S15 NOT SI 3 

517 1238781 DATA OR INFORMATION OR FIELD? ? OR CELL? ? 

518 149099 S17 (5N) (REDISTRIBUT? OR DISTRIBUT? OR SHUFFL? OR RESHUFFL? 
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SORT??? OR RESORT? ?? OR REARRANG? OR REORGANI? OR REGROUP??? 
OR RE() (ARRANG? OR ORGANI? OR GROUP??? OR ORDER?)) 
319 33 S12 (50N) S18 (50N) S3 AND IC=G06F 
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. SPECIFICATION that such attributes were at some point part of a 
candidate key (since they were originally associated with data elements 
hf.: vine: Value ( underscore) 1 that have subsequently been changed to 
'W; lue ( underscore ) 3 as a result of a join operation). As explained below, 
ihis change from Value ( underscore ) 1 to Value (underscore ) 3 was made due to 
the fact that the join may cause duplicate values to occur in the 
result, thereby rendering the candidate key to no longer be unique. 
However, after the group by, duplicates are removed and the candidate key 
is again unique for each row. 

If neither condition 512 nor 516 is satisfied, then the optimizer 
module 20 sets ( at 520) the data elements of all attributes (the 
grouping fields) to Value ( underscore ) 2 . In effect, as part of the 
partial group -by operation on a table T, the grouping fields are set 
to either Value ( underscore ) 1 or Value (underscore ) 2 , depending on the 
conditions noted above. 

The optimizer module 20 also updates (at 522) the data structure 401 in 
■'■ sr\rise to a join of two tables (table Tl and... 
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..SPECIFICATION facilities 1201-4)) by the parsing engine 130 (not shown). 
For example, two columns 210, 220 can be designated as the primary index 
when the table is created. The hash function is then applied to the 
contents of columns 210, 220 for each row. The hash bucket portion of the 
resulting . . . 



..number and the hash function is the sum of remainder when the row number 
is divided by four and the value one, the first eight rows will be 
distributed as shown in Fig. 2. 

Queries involving the values of columns in the primary index can be 
efficiently executed because the processing module llOn) ) having... 
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.SPECIFICATION the further Group-by on A of the Group-by on B has an 
identical result of group-by A. Mathematically, if t is a table , then 

Group-by on B (t) (contains subset) Group-by on A(t) . 

Group-by on A (Group-by on B (t) ) = Group-by on A(t) . 

The Group-by on B groups or sorts all rows using attributes in B 
before an aggregation is performed. Since B is larger than A, the set of 
rows with a ...is the subset of rows with the same constant in A. Hence, 
group-by B is a finer partition of group-by A. 

Given two tables tl and t2, the following proposition (referred to 
as "Proposition 1") is correct: 
(el x " t2) ' - (tl* x t2' ) ' 
[i J x c2) ' - (tl 1 x t2) ' 

This provides flexibility in how the tables are joined to achieve 
'he- final join result. The optimizer module 20 can thus choose among 
pLitiral join paths that involve Groupby tables . 

Given a table t, a Group-by on attribute x is a partition of the rows 
into the classes of the same value in x. If the' table... 
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... SPECIFICATION array (see symbol 2401 of Fig. 24), the number of record 
numbers starting from what position that are associated with record 
riurnbers in the master table can be found from the count and start 
position, respectively. To wit, the aforementioned position matches the 
position indicated by the count and the aforementioned. . . 



..total and start position looked up in Step 2703, and in the array of 
pointers to the value list within the information block including the 
fields to be displayed in the joined table (view) , the pointer 
values indicating the various record numbers are fetched and rearranged 
in order as a new pointer array (Step 2703) . Thereby, among the group 
of information blocks to be joined , those to be derived from the master 
table are complete upon their creation. 
In Fig. 28, regarding record number "0," "0" is the pointer value to 
• "v ^spending position (row 1) within the... 

. : -..-re fetched. From these values, one can see that the "2-0=2" record 
nurnoers "1" and "3" from the position corresponding to "0" ( row 1) in 
the sorted set ( sort array) are the record numbers of the slave table 
associated to record number "0" of the master table . Accordingly, on 
the slave table side, the pointer values of "0" and "1" are fetched 
sequentially from the positions indicated by the record numbers "1" and 
"3" (row 2 and row 4) within the information block for the "Game date" 
field to be displayed in the table (view) , and these are placed as 
new pointers in the value list of the information block for "Ticket/Game 
date" which is an information block for the joined table (view) . Note 
that the value list contained in this information block for "Ticket /Game 
date" is identical to the value list contained in the information. . . 
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...SPECIFICATION of the outer table for rows which satisfy the local 

condition; then, for each of these rows, a complete join column scan of 
■ inr.ei: Table 10 or a probe via an index on the join column is made 
Linn matching records. 
The sort/merge join of the prior art is illustrated in Figure 3 of the 
accompanying drawings. A pre-condition of this technique is that the 
outer and inner tables be ordered based upon their join columns, which 
enables the join to be done similarly to the MERGE phase of a SORT 
routine. Relatedly, a table can be ordered by sorting, or reached 
through an index to provide access in join column sequence. Local 
predicates on each table are applied before a sort, as illustrated in 
Figure 3. Thus, the Table 20 is built from the Table 12 by sorting 
the manager = Davis rows in order by employee number magnitude. 
Similarly, a sorted Table 22 is derived from the skill Table 10 by 
selecting skill table records having "test" in their skill fields and 
ordering those records by employee number. Once join column order is 
imposed on the tables , the join is done very efficiently using the join 

column sorted structures. Thus, the join begins by scanning the join 
column of the Table 22 with the employee number 53, corresponding to the 
first entry in... 

...and the scan of the join column of the Table 22 is picked up from where 
the last scan stopped. In this manner, the join columns of the sorted 
Tables 20 and 22 are each scanned only once in effecting the join of the 
tables. In comparison, the nested loop join procedure requires that... 

...table row is found satisfying the outer table local predicate or 
predica t es . 

The nested loop join technique makes efficient use of an index on the 
join column of the inner table . The nested loop join technique is 
; j': when the join column values passed to the inner table are in 
:'<-'•' :;;n and the join column index of the inner table is clustered, 
^nd when the number of rows retrieved in the inner table by finding 
inarching values in the index is small. As is known, an index is 
clustered when the rows of the table are, for the most part, stored in 
the same physical sequence as the sequence of key values. 

The drawbacks of the nested loop join are... scan on T2 is open, Then 
Close it. 

If scan on intermediate result table is open, then 

The scan on the index of Table 10 is discussed in greater detail 
below. This scan progresses sequentially through the index on the join 
column of the Table 10 to retrieve the... 



. RID's of the Table 10 rows in an index format having the form (key 
v-:! .1 ue : RI D list) . Since the scan is keying on the join column of 
Table 10, the key value is the value of the join column of an outer 
table row , while the parameter "RID list" is a list which identifies 
table rows which have that key value. 
The procedure 35 in Figure 4 sorts the rows of Table 12 by the 
local predicate or parameter (manager = "Davis") and orders the 
corresponding records by their join column values. The outcome of the 



process is represented by the Table 38. 

Returning to the open cursor process of Flow Chart I, a DO loop is 
opened in step 102 which is continued until the end of file (EOF) 
.indicators are encountered in Table 12. In the procedure, for each of 
che rows in the outer table , Table 12, which satisfies the predicates 
or parameters which are local to the outer table, the RID list is 
obtained from the index on the Table ... illustrates the embodiment of the 
invention described above and embodied in Flow Charts I-III. One 
principal precondition of the embodiment is that the outer table be 
ordered by sorting or indexing on the join predicates or table space scan 
with a well-clustered index on the join column. In Figure... 

.is not limited to ordering the outer table by the procedure 35. As is 
known, ii an index exists on one or more of the join columns of the 
<..uiy.r table , ordering is implicit in the index. 

The technique described also presupposes ordering of the inner table 
by indexing on its join columns. Again, indexing implies ordering and 
supports sequential access to the rows of the table if the RID's are 
ordered. 

Thus, using the rows of the outer Table 12, ordered and sorted by 
the join column , the technique described takes, as an index key 
value, the employee number of the first record, which is 53. This key 
value is used to scan the index of Table 10. In this regard, the scan 
on the index of Table 10 starts at the set (1:9) and scans from there 
down to the set (53:1,11) . The list (1,11) is fetched from the set and 
two composite rows are built and placed in the Table 45. When placed in 
the table , the rows are ordered, so that the first row is that denoted 
:*y "he RID value of 1 and the second by the RID value... 
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. SPECIFICATION data skew by partitioning the join operation into separate 
jobs and optimally scheduling the jobs among a plurality of processors. 

A common operation in relational database systems is the natural join 
of two relations on respective columns defined over a common domain. See, 
for example, the description of the natural join. . . 

. row is the concatenation of two rows, one from each of the original 
relations, such that both rows have the same value in their respective 
join columns . 

One popular algorithm for computing the join of two relations is the 
sort-merge technique as described by M.Blasgen and K. Eswaran in 
"Storage and Access in Relational Databases", IBM Systems Journal, vol. 
4, pp. 363 et seq. (1977). It can be summarized briefly as follows: 
First, each of the relations is sorted (if necessary) according to the 
join column . Second, the two sorted relations are scanned in the 
obvious interlocked sequence and merged for rows which have equal 
values. 

When sort -merge joins are performed in parallel on a multiple 
nro-esso.r database system, there exists a problem of data skew that might 
■ ■; s- : n *:he join columns of the relations. In general, the issue of 

--'.•■'a : r.oi addressed by the join algorithms described in the literature. 

: . • 1 a r J y article on parallel... 

.runs from disk. The merge tree is mapped to different processors with 
the final merge being sequential. 

In "Join and Semi join Algorithms for a Multiprocessor Database 
Machine", ACM Trans, on Database Machines, vol. 9, no. 1, March 1984, pp. 
133-161, P. Valduriez and G. Gardarin describe the algorithm generalized 
to. . . 
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Detailed Description 

. . . can be a table scan, a full index scan or a partial index scan for 
example . 

For a join statement that joins more than 2 tables , the query optimizer 
chooses which pair of tables is joined first and then which table is 
joined to the result, and so on. The query optimizer then chooses an 
operation to use to 1 5 perform the join operation. 

::: row set, the inner row set is accessed to find all the matching rows 

join . Therefore, in a nested loop join , the inner row set is 
: : :<vssed as many times as the number of rows in the outer row set. 

In a sort merge j oin, the two row - sets -being j oined are sorted 
by the j oin keys if they are not already in key order. 

In a hash join, the inner row set is hashed into memory, and a hash 
table is built using the join key, which is the probe key for the join 
operation. Each row from the outer row set is then hashed, and the hash 
table is probed to join all matching rows. If the inner row set is very 
large, then only a portion of it is hashed into memory... the optimizer 
decides 

that the amount of data is large enough to warrant a hash join, or it is 
;:i;abJe to drive from the outer table to the inner table . The outer 
■ . (with preserved rows) is used to build the hash table, and the 

iruiv'f cable is used to probe the hash table. 

.■->r" merge j 6 ins can be used to join rows from two independent 

.so j rces . 

Sort merge joins are useful when the join condition between two tables 
is an inequality condition (but not a nonequality) like <, <--, >, or >=. 
In a merge join , there is no concept of a driving table . This type 
of join operation may optionally be perfon-ned as follows. 

1 5 1. Sort join operation: Both inputs are sorted on the join key. 

2. Merge join operation: The sorted lists are merged together. 

M the input is already sorted by the j oin column , then a sort j 

o i n 

operation is not performed for that row source. 

Sort merge outerjoins are used when an outerjoin cannot drive from the 
outer table to the inner table. 

A 
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retailed Description 

. . . To obtain che concents of the change table 223 in the order in which 
• :■ ;;: or t i!. ion originally occurred, a database join operation between the 
■\ table 223 and che cransaction 240 can be used. For example, one 

i ;;ip 1. ementa tion of the present invention may use the following SQL join 
operation, in which change table 223 has source table columns 231 
named CI, ...that all SQL statements within a given 

transaction have the same value for a transaction identifier and that 
this value was stored in the transaction table 240 in the same row as 
the commit system change number 243. Therefore a join across the two 
tables matches up all SQL statements belonging to a particular 
transaction with their associated commit system change 

number 24 3. Moreover, the SQL ORDER BY clause returns the change rows 

in 

increasing sorted order, according to their commit system change 
number, which is to say, in the original order in which the transactions 
commie ted. 

Moreover, the end user or the subscriber application 121 need not be 
aware of the fact that there is a join between the change table (that 
does not contain the commit system change number) , and the transaction 
table (that contains the commit system change number and little else) . 
The feature may be attained by providing a subscriber view 251 , 
generated on behalf of... 
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Detailed Description 

. . . for the Parity column, the "FRENCH" value corresponding to the record 
is the value in the corresponding cell of the FRENCH column in the value 
table . In the example, the entry in row 5 of the Parity column of the 
: issuance table is associated with the record being reconstructed. Thus, 
i he "French" value is found in row 5 of the "French" column of the value 
table , whose value is "Trois". 

Alternatively, an unsorted column may be included in the data structures 
of the present invention by using the identity permutation as the 
permutation for that column (i.e., the value table for that column 
will not be reordered in any way) . 
1 5 

Column Merge Compression 
In accordance with a farther embodiment of the invention, separate value 

table columns can be merged into a single column, referred to herein 
as a "union column, " with separate displacement list columns for each of 
the original columns. This has the potential advantages of having a 
smaller value table , pre- joined data expediting join operations and 
improved update speed. A value not present in a particular original 
column is indicated in the displacement table column by a null range 
for that value. For example {assuming a "first row number" format 
displacement table ) , if the original column did not have the value at 
rcwYof the merged column, the displacement table for that column would 
have the same value at row Y and row T+ 1' {that is Displacement - Table 
( r+1 , c ) Displacement -Table ( r, c) =0) . If Y is the last row in the column, 
its value is set to a number greater. . . 
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De tailed Description 

... 0, "Country" 312, and "Language" 314 columns. 

All of the above Figure 2 and Figure 3 elements discussed are real 
elements . 

They exist inside relational tables managed by the system. The system 
also provides for other types of identifiers, namely virtual elements. 
Virtual elements are identifiers that are not real elements... 
...for example, population", "age", and "religion". The second category of 
virtual element, complex virtual elements, represent information that is 
computed by the system from table join operations or other types of 
r.: r i. t n rne tic processing. 



example, the "cast list" 914 associated with a movie, such as the 
movie titled "Rain Man", can be queried like an attribute of a particular 
movie. No such attribute exists as a real element inside any of the 
movie related tables of the illustrative system I 00. The system 1 00, 
can perform a table join of the Movie Information Table 202 and the 
Movie Personnel Table I I 00 to match a movie identifiers in the 
columns 204 and I 1 04 from each table. 

Figure 1 1 is a block diagram depicting a Movie Personnel Information 
Table I I 00. The role identifier column 1 102, lists role identifiers 
'■hat identify roles associated with various movies as indicated by the 
movie identifiers residing in the same row and I 0 listed in the 
movie identifier column 1 104. The movie identifier column II 04 
lists identifiers to various movies . The person identifier column 1 1 
06 lists person identifiers of persons associated with movie roles. The 
role type column 1 1 08 lists the types of roles, such... 
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Detailed Description 

... the function is computed using all preceding rows. The first two rows 
in the table or group however will have the NULL value. 

TRANINT Results Table 

This table supports the TRANINT function that derives counts, amounts, 
percentage means, and intensities from a transaction summary file. This 
function takes as parameters a... 

...calculate the average periodic transaction counts and transaction 

::-^ , :r. T . s by customer and transaction type (such as debit or credit) from a 

■ r-j r. s rt ; ; i \ o n s u mm a r y table. 

Dai: a Reorganization - Results Tables and Column Definitions 

The following describes the results tables and column definitions for 

the Data Reorganization functions. 

JOIN Results Table 
This table supports the JOIN function that joins tables together 
into a 

combined result table . This function takes as parameters a list of 
tables, keys, and column lists to combine new derived variables into an 
analytic data set, prior to building a matrix or... 

. . .be requested, which returns rows for all key column values found in the 
i:irsc table specified, and fills in any missing values from the other 
tables with null values. 



DENORM Results Table 

This table supports the DENORM function that selects or creates a new 
denormalized table. This function takes as parameters a table name, the 
name of key column. . . 
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: ;t? t; a i led De s c r i pt i on 

. . . tor the Parity column, the 

" FRENCH " value corresponding to the record is the value in 
the corresponding cell of the FRENCH column in the value 

table . In the example, the entry in row 5 of the Parity 
cclumn of the instance table is associated with the record 
being reconstructed. Thus, the "French" value is found in 
row 5 of the "French" column of the value table , whose value 
is "Trois". 

Alternatively, an unsorted column may be included 
in rhe data structures of the present invention by using the 
l-i^riticy permutation as the permutation for that column 
. , the value table for that column will not be reordered 

, \j i n any way) . 

Column Merge Compression 
In accordance with a further embodiment of the 

invention, separate value table columns can be merged into a 
single column, referred to herein as a "union column, " with 
separate displacement list columns for each of the original 
columns, This has the potential advantages of having a 
smaller value table , , pre- joined data expediting join 
operations and improved update speed. A value not present in 
a particular original column is indicated in the displacement 
table column by a null range for that value, For example 

(assuming a "first row number" format displacement table ), if 
the original column did not have the value at row Irl of the 
merged column, the displacement table for that column would 

•)5 have the same value at row Irl and row Ir+11 (that is 
Displacement- Table ( r+1 , c ) -Displacement Table (r, c) =0) , If Irl 
is the last row in the column, its value is set to a number 
- 32 . . . 
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TableDef , 

} f IsNumberOrEnglishLetter = tblFrorn As Table, tblTo As Table, 
! LsEnqlishLetter (Character ) Or FieldNot ForCompare ( As String) As 
i f 1 sNumber (Character) Integer 
End Function 

Dim FieldCount As Integer 

Function IsInArrayStr (ByVal aStr As Dim FieldNumber As Integer 
String, aArray( As String) As... 
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Claim 

. . . they are executed during an initial set-up phase of query execution. 
They fall into two general groups: DDL set-up queries to create temporary 

tables or indexes; and DML set-up queries, which could be used in 
multi-stage execution strategies to populate temporary tables with 
intermediate results. Potentially, a redundant sorting of the non-driving 
t^ble in the join by each parallel subquery, either by pre-sorting or 
by preindexing the non-driving table. If pre-sorting is used, only those 
rows which satisfy singletable predicates are inserted in a temporary 
table , which is indexed on the join columns , and 
the temporary table replaces the original table in t 

he FROM clauses of the parallel 

subqueries. If pre-indexing ' is used, the entire table must be indexed 
on the join columns . Either way, the resulting table can now be 
used as the inner table in a nested loops join . Any set-up queries 
which are generated as part of the transformation of a given query must 
be executed to completion before proceeding with execution... 

. . . • remaining query types, and could concept ua J. ly be_p_e.r formed- in - — — 

__L^- -J-I<-J— w-iXh it-. ~~ ~ ' 

' ; ( >,j n- up Que ries 

t:;;r each set-up query which creates a temporary table or index, a 
corresponding clean-up query is required to dispose of that temporary 
object. Clean-up queries are generated at the same time set... 
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' ;v-s-" r i pt ion 

... t riv present invention, the smallest constituent parts are row sources. 
•\ rcw source is an ob j ector iented mechanism for manipulating rows of data 
in a relational database 

system (RDBMS). A row source is implemented as an iterator. Every row 
source has class methods associated with it {e.g., open, fetch next and 
dose ) . 

Examples of row sources include: count, filter, join , sort , union, 
and table scan. 

Other row sources can be used without exceeding the scope of the 

present 
I nven u. ion . 

As a result of the compilation process, a plan for the execution of a 

query 

is generated. An execution plan is a plan for the execution... 
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Set Items Description 

51 4083 (JOIN OR JOINS OR JOINED OR JOINING OR EQUIJOIN???) (7N) (RO- 

W? ? OR TUPLE? ? OR COLUMN? ? OR FIELD? ? OR CELL? ? OR TABLE? 
?) 

52 365 S1(7N) (SIMULTANEOUS? OR CONCURREN? OR COINCIDENT? OR PARAL- 

LEL OR SYNCHRONIZ? OR SYNCHRONIS??? OR SYNCHRONOUS? OR SYNC OR 
SAME () TIME OR AS OR WHILE) 

53 1221004 DATABASE? ? OR DATA {) BASE? ? OR TABLE? ? 

54 4190 (JOIN OR JOINS OR JOINED OR JOINING OR EQUIJOIN???) (20N) (S- 

1MULTANEOUS? OR CONCURREN? OR COINCIDENT? OR PARALLEL OR SYNC- 
HRONIZ? OR SYNCHRONIS??? OR SYNCHRONOUS? OR SYNC OR SAME () TIM- 
ID 

824 SI (25N) (SIMULTANEOUS? OR CONCURREN? OR COINCIDENT? OR PARA- 
LLEL OR SYNCHRONIZ? OR SYNCHRONIS??? OR SYNCHRONOUS? OR SYNC - 
OR SAME () TIME OR AS OR WHILE) 

56 95207 REDISTRIBUT? OR RE ( ) DISTRIBUT? 

57 50 S4 : S5 AND S6 

58 36 S3 AND S7 

59 . 21 RD (unique items) 
Sl'b 17* S9 NOT PY=2002 : 2004 

511 83 S2 AND S3 

512 1814 3573 ROW? ? OR TUPLE? ? OR COLUMN? ? OR FIELD? ? OR CELL? ? OR - 

DATA OR INFORMATION 
S;3 590701 S12 (5N) (REDISTRIBUT? OR DISTRIBUT? OR SHUFFL? OR RESHUFFL? 

O.R .SHIFT-??-? -OR-RESH-I-FT-??-?-OR-MOV?-??-OR- MIX"???" OR "REORDER?'??" OR 

SORT??? OR RESORT??? OR REARRANG? OR REORGANI? OR REGROUP??? 
OR RE ( ) (ARRANG? OR ORGANI? OR GROUP??? OR ORDER?)) 

514 15 Sll AND S13 

515 10 RD (unique items) 

516 29 S5 AND S3 AND S13 

517 19 RD (unique items) 

518 19 S15 OR S17 

519 11 S18 NOT (S10 OR PY=2002 : 2004 ) 
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Abstract: This paper describes PBSM (Partition Based Spatial-Merge) , a 
new algorithm for performing spatial join operation. This algorithm is 
especially effective when neither of the inputs to the join have an index 
on the joining attribute. Such a situation could arise if both inputs to 
the join are intermediate results in a complex query, or in a parallel 
environment where the inputs must be dynamically redistributed . The PBSM 
algorithm partitions the inputs into manageable chunks, and joins them 
•jsing a computational geometry based plane-sweeping technique. This paper 
-ol.so presents a performance study comparing the traditional indexed nested 
loops join algorithm, a spatial join algorithm based on joining spatial 
indices, and the PBSM algorithm. These comparisons are based on complete 
implementations of these algorithms in Paradise, a database system for 
handling GIS applications. Using real data sets, the performance study 
examines the behavior of these spatial join algorithms in a variety of 
situations, including the cases when both, one, or none of the inputs to 
the join have an suitable index. The study also examines the effect of 
clustering the join inputs on the performance of these join algorithms. The 
performance comparisons demonstrates the feasibility, and applicability of 
the PBSM join algorithm. (Author abstract) Refs. 
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Journal Announcement: 9609W2 

Abstract: The transitive closure operation is an important extension to 
relational algebra. Because of its high computation cost, it is of great 
interest to design efficient parallel algorithms for computing the 
transitive closure in relational database systems. In this paper, we 
present a new algorithm to compute transitive closures on SIMD meshes based 
on relational algebra operations. Double-hash distribution is used to avoid 
rehashing new tuples for the next join phase. There presently exists no 

■ ra step for the redistribution of these tuples . Possible redundant 
• ■:: ,' r." :-in between different join phases has been prevented without 

j ! oba J operations. As only regular linear communication occurs on 
! :.*- mesh, and the workload is fully distributed, a speedup of 0(n 

u i r ■plied by n) has been achieved, where n multiplied by n is the size of 
;iiosh. Therefore, this algorithm is an optimal parallel version of the 
transitive closure algorithms based on relational algebra operations on 
SIMD meshes. (Author abstract) 28 Ref s . 
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Abstract: An important aspect of database processing in parallel 
computer systems is the use of data parallel algorithms. This paper 
presents several parallel algorithms for the relational database join 
operation in a hypercube multicomputer system. The join algorithms are 
classified as cycling or global partitioning based on the tuple 

i s r. r i bu t ion method employed. The various algorithms are compared under a 
-cmrnon framework, using time complexity analysis as well as an 
implementation on a 64 node NCUBE hypercube system. In general, the global 
partitioning algorithms demonstrate better speedup. However, the cycling 
algorithm can perform better _t ha n_t he -global -a 1-go-r-ithms- in specific 
situations, viz, when the difference in input relation cardinalities is 
large and the hypercube dimension is small. We also study the usefulness of 
the data redistribution operation in improving the performance of the 
join algorithms, in the presence of uneven data partitions. Our results 
indicate that redistribution significantly decreases the join algorithm 
execution times for unbalanced partitions. (Author abstract) 21 Refs. 
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A: :s ' rar : Distributed memory architectures, specifically hypercubes, for 
1 c*l database processing are treated. The cube interconnects 
.^-efficient data combination for the various database operations, 
nuo nonuniform data distributions are handled by dynamically 

redistributing data utilizing these interconnections. Selection and scalar 
aggregation operations are easily supported. An algorithm for the join 
operation is discussed in some detail. The cube is compared with another 
multicomputer database machine, SM3, and the performance of the join 
operation in these systems is described. The join performance in a cube is 
comparable to that in SM3 even when the cube is assumed to have a 
nonuniform data distribution. 15 refs. 
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Abstract: Strategies for performing database operation in a 
cube-connected multicomputer system with parallel I/O are presented. The 
cube interconnection subsumes many other structures such as the tree, ring, 
etc. This property is utilized to efficiently support database operations 
such as select, aggregate, join , and project. The strategies presented 
are unique in that they account for the nonuniform distribution of data 
across parallel paths by incorporating data redistribution steps as 



part of the overall algorithm. The two main data redistribution 
operations used are tuple balancing and merging. 17 refs. 
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The research question in this thesis concerns how to parallelize the 
spatial range and join query processing in order to support a high 
pt i: i:o trrnance spatial database application. Data partitioning for the range 
query operation involves declustering of spatial data, while data 
partitioning for the spatial join involves clustering of spatial data. If 
the static partitioning methods fail to equally distribute the load among 
different processors, the load-balance may be improved by redistributing 
parts of the data to idle processors using Dynamic Load-Balancing (DLB) 
techniques . 

In this thesis, we provide a framework for declustering collections of 
extended spatial objects by identifying the following key issues: (i) the 
work- Load metric, (ii) the spatial-extent of the work-load, (iii) the 
- • i s ■ r i bu t i on of the work- load over the spatial-extent, and (iv) the 

method. We identify and experimentally evaluate alternatives 
: • ■ j ; ■ . : these issues. 

: : ,j;Jdi Lion, we also provide a framework for dynamically balancing the 
t j ween different processors. We experimentally evaluate the proposed 
-..ste ring and load-balancing methods on a distributed memory MIMD 
machine {Cray T3D) and shared-memory machine (SGI Challenge) . Experimental 
results show that the spatial-extent and the work-load metric are important 
issues in developing a declustering method. Experiments also show that the 
replication of data is usually needed to facilitate dynamic load-balancing, 
as the cost of local processing is often less than the cost of data 
transfer for extended spatial objects. In addition, we also show that the 
effectiveness of dynamic load-balancing techniques can be improved by using 
declustering methods to determine the subsets of spatial objects to be 
transferred during run-time. 

A spatial join is often performed in two steps: a filter step and a 
_re_f in ejoenjL s_t-e.p_. _I.n._ this— thesis-, -we - focus -on— the -refinement" step "of ~the — ~~ 
spatial join. The refinement step of the spatial join takes as input a 
sequence of pairs of tuples and checks each tuple to see if the join 
predicate is satisfied for that tuple . This is similar to the join 
index processing done in traditional relational databases . We develop 
min-cut graph partitioning based methods for join processing using a join 
index. We use min-cut graph partitioning as a new heuristic for solving the 
page access sequence problem for fixed size buffer in sequential systems. 
We show that the number of page accesses needed to compute a join using 
join index in a fixed buffer environment is bounded by the sum of sizes of 
the base relations and the size of the cut-set of the page connectivity 
;raph. Since the min-cut graph partitioning aims to minimize the size of 
* -ser, this proposed heuristic is a direct method. Experiments with 



benchmark data sets show that the graph-partitioning based heuristic 
outperforms the existing methods, particularly when join selectivity is 
high and buffer space is small. (Abstract shortened by UMI . ) 
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Abstract: Known parallel DBMS offer at present only static partitioning 
schemes. Adding a storage node is a cumbersome operation that typically 
requires the manual data redistribution . We present an architecture 
termed AMOS-SDDS for a share-nothing multicomputer. We have coupled a 
high-performance main-memory DBMS AMOS-II and a manager of Scalable 
Distributed Data Structures (SDDS) into a scalable distributed system SDDS 
provides the scalable data partitioning in distributed RAM, supporting 
parallel scans with function shipping. AMOS-SDDS couples both systems using 
'hf* AMOS-II foreign function interface. Its scalability abolishes the 
: .::::r i • rsome storage limits of a single site RAM DBMS technology. Its 
ribuced RAM query processing and scalable data partitioning is an 
improvement over the current parallel DBMSs technology. We validate 
AMOS-SDDS architecture by experiments with distributed nested loop join 
queries over a file scaling up to 300,000 tuples. It includes performance 
study of speed-up and scale-up characteristics. The results encourage the 
use of SDDS for high-performance database systems. (19 Refs) 
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Abstract: Join is an expensive and frequently used operation whose 
parallelization is highly desirable. However, the effectiveness of 

parallel joins depends on the ability to evenly divide load among 

processors. Data skew can have a disastrous effect on performance. Although 
many skew-handling algorithms have been proposed, they remain generally 
inefficient in the case of multi- joins due to join product skew, costly 
and unnecessary redistribution and communication costs. A parallel 

join algorithm called fa- join was introduced in an earlier paper with 
deterministic and near-perfect balancing properties. Despite its 
advantages, fa- join is sensitive to the correlation of the attribute value 

I . l;; i ibiM'ions in both relations. We present an improved version of the 

= . : i i hm called Sfa-join with a symmetric treatment of both relations. Its 
; !■;: ;. "i.ab.L y low join-product and attribute-value skew makes it suitable for 
■ >:d use in multi- join operations. Its performance is analyzed 
'. :\e< •;: ec ica liy and experimentally, to confirm its linear speed-up and its 
superiority over fa-join. (16 Refs) 
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Abstract: For complex queries in parallel database systems, substantial 
amounts of data must be redistributed between operators executed on 
different processing nodes. Frequently, such intermediate results cannot be 
held in main memory and must be stored on disk. To limit the ensuing 
performance penalty, a data allocation must be found that supports parallel 
I/O to the greatest possible extent. In this paper, we propose declustering 
even self-contained units of temporary data processed in a single operation 
(such as individual buckets of parallel hash joins } across multiple 
disks. Using a suitable analytical model, we find that the improvement of 



parallel I/O outweighs the penalty of increased fragmentation. (14 Refs) 
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Abstract: We investigate various load balancing approaches for hash-based 
join techniques popular in multicomputer-based shared-nothing database 
systems. When the tuples are not uniformly distributed among the hash 
buckets, redistribution of these buckets among the processors is 
necessary to maintain good system performance. Two recent load balancing 
techniques which rely on sampling and incremental balancing, respectively, 
have been shown to be more robust than conventional methods. The comparison 
: • hnse two approaches, however, has not been investigated. In this study, 
;:r,prove these two schemes and implement them along with a conventional 
:: ■ ■ i and a standard join technique which does not do load balancing on 
-i;; nCUBE/2 parallel computer to compare their performance. Our 
experimental results indicate that the sampling technique is the better 
approach. To further evaluate the performance of these techniques under 
diverse hardware conditions, we also develop a cost model and implement a 
simulator to perform sensitivity analyses with respect to various hardware 
parameters. The simulation results show that both sampling and incremental 
techniques provide noticeable savings over conventional methods, with the 
sampling approach being more scalable in supporting very large database 
systems. (26 Refs) 
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Abstract: A multiple-weighted-tree based query optimization method for 
parallel database systems is proposed in this paper. The method consists 
of a multiple-weighted-tree based parallel query plan model; a cost model 
for parallel query plans, and a query optimizer. The parallel query plan 
model is the first one to model all basic relational operations, all three 
types of parallelism of query execution in parallel database systems, 
processor and memory allocation to operations, memory allocation to the 
buffers between operations in pipelines and data redistribution among 
processors. The cost model takes the waiting time of the operations in 
: ininc execution into consideration and is computable in bottom-up 

'■isnlcn. The query optimizer addresses the query optimization problem in 
■ /■ontext of Select-Project-Join queries that are widely used in 

■ r.::. a 1 DBMSs . Several heuristics determining the processor allocation 
' . . operations are derived and used in the query optimizer. The query 
optimizer is aware of memory resources in order to generate good-quality 
plans. It includes the heuristics for determining the memory allocation to 
operations and buffers between operations in pipelines so that the memory 
resource is fully exploit. In addition, multiple algorithms for 
implementing join operations are considered in the query optimizer. The 
query optimizer can make an optimal choice of join algorithm for each 

join operation in- a query. The proposed query optimization method has 
been used in a prototype parallel database management system designed 
-::;d imp i. erne n ted by the author. (11 Refs) 
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Abstract: The transitive closure operation is an important extension to 
relational algebra. Because of its high computational cost, it is of great 
interest to design efficient parallel algorithms for computing the 
transitive closure in relational database systems. In this paper, we 
present a new algorithm to compute transitive closures on SIMD meshes based 



on relational algebra operations. The double-hash distribution is used to 
avoid rehashing new tuples for the next join phase. There presently exists 
no extra step for the redistribution of these tuples . Possible 
redundant computation between different join phases has been prevented 
without using global operations. As only regular linear communication 
occurs on the mesh and the workload is fully distributed, a speedup of 
0(n*n) has been achieved, where n*n is the size of mesh. Therefore, this 
algorithm is an optimal parallel version of the transitive closure 
algorithms based on relational algebra operations on SIMD meshes. (28 
Ref s) 
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Abstract: When data is uniformly distributed, the parallel hash-based 
join algorithm scales up well. However, the presence of data skew can 
cause load imbalance among the processors, significantly deteriorating its 
performance. In this paper we propose a dynamic skew handling algorithm 
which deals with this load imbalance, by detecting and handling join 
product skews at run-time. The idea is to monitor the join processing at 
the join phase and compare the average processing rate of each partition 
with the rate statically predicted at the scheduling phase. If their 
difference is detected to be large enough to produce a significant 
performance degradation, the processor is considered to be overloaded and a 
workload compensation strategy is dynamically invoked. In this case, based 
on the measured average processing rate, _the _amo.unt of-overlead -caused by — 
— ti-h^— - a-npTBdi ct e - d _ j ol.n~product" s kew is calculated and, the amount of load to 
be migrated to the non-overloaded processors is determined. We propose two 
methods-the result redistribution and the processing task migration-to 
handle the load migration from the overloaded processor to the 
non-overloaded processors. Simulation results show that our dynamic skew 
handling approach can detect and handle load imbalances efficiently, so 
that the rebalance of load among the processors results in an almost 
constant join execution time under different join product skews. (11 Refs) 
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Abstract: Parallel database systems have to support both 

inter- t ransact ion as well as int ra-transaction parallelism. 

] r.ter- 1 ransact ion parallelism (multi-user mode) is required to achieve high 
; :-: ro'j'jjhpu t , in particular for OLTP transactions, and sufficient 
■t s •; -e i. feet i veness . I nt ra- 1 ransact ion parallelism is a prerequisite for 
■educing the response time of complex and data-intensive transactions 
(queries). In order to achieve both goals dynamic strategies for load 
balancing and scheduling are necessary which take the current system state 
into account for allocating transactions and subqueries to processors and 
for determining the degree of intra-transaction parallelism. We study the 
load balancing problem for parallel join processing in shared nothing 

database systems. In these systems, join processing is typically based 
on a dynamic redistribution of relations to join processors thus making 
dynamic load balancing strategies feasible. In particular, we study the 
performance of dynamic load balancing strategies for determining the number 
of join processors and for selection of the join processors. In 
contrast to previous studies on parallel join processing, we present a 
i-iulti-user performance analysis for both homogeneous and 

hnterogeneous/mixed workloads as well as for different database 

allocations. (34 Refs) 
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/■.:■.■':":■ ACT : This paper presents a dynamic load balancing algorithm for 

right-deep pipelined hash multi-joins executed on a shared nothing 
architecture. The algorithm makes use of a centralized processor called 
the foreman to gather statistics from the processors participating in 
the join. With these statistics, the foreman is able to determine how 
the build relation tuples should be redistributed to equalize the 
number of result tuples remaining to be generated by each of the 
processors. The unit of load migration is a hash line, the set of build 
tuples mapped to the same hash entry in a pipeline stage's hash table 
, and load balancing is performed separately for each stage of the 
pipeline, (author abst . ) 
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ABSTRACT: When data are uniformly distributed, _ ..parallel -hash-based join 
.__ algori thm- scares - up" well . "However , the presence of data skew can cause 
load unbalances among the processors, significantly deteriorating its 
performance. Within the last years, there has been a growing interest 
in addressing the problem of data skew. Many proposed algorithms add an 
extra sampling or scanning phase and a scheduling phase to the usual 
hash and join phases. Based on statistics of fine partitions of the 
relations Obtained in the sampling or scanning phase, they make 
estimations of the join costs in the scheduling phase, and attempt to 
balance the load across the multiple processors for the subsequent join 
phase. However, these algorithms still rely on simplistic assumptions 
of the data distribution within each fine partition. The optimality of 
the resulting load balance depends on the validity of these 



assumptions, which, in some cases, results in discrepancies with the 
real data and thus, in an unexpected join product skew. In this paper 
we propose a dynamic skew handling algorithm which deals with these 
load unbalances, by detecting and handling at run-time the join product 
skew, which could not be correctly predicted in the scheduling phase. 
When an overloaded processor is detected at run-time, the overload of 
producing the skewed output tuples (mainly writing the large number of 
result tuples) is dynamically migrated to other processors, while the 
reading of the building and probing partitions are stuck at the 
overloaded processor. We propose two new methods, the result 
redistribution and the processing task migration, to handle this 
•'vt) rload migration. Simulation results show that our dynamic skew 
handling approach can detect and handle load unbalances efficiently, so 

hac the rebalance of load among the processors results in an almost 
constant join execution time under different join product skews, 
(author abst . ) 
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We investigate various load balancing approaches for hash-based join 
techniques popular in multicomputer-based shared-nothing database 
systems. When the tuples are not uniformly distributed among the hash 
buckets, redistribution of these buckets among the processors is 
necessary to maintain good system performance. Two recent load balancing 
techniques which rely on sampling and incremental balancing, respectively, 
have been shown to be more robust than conventional methods. The comparison 
of these two approaches, however, has not been investigated. In this study, 
we improve these two schemes and implement them along with a conventional 
ir.ovhod and a standard join technique which does not do load balancing on 
nCUBE/2 parallel computer to compare their performance. Our 
--xperimencal results indicate that the sampling technique is the better 
approach. To further evaluate the performance of these techniques under 
diverse hardware conditions, we als o _devel op _a _cQst_mo del- and— implement— a~ 
- -s-imula-tior- -to - perf o r m" sens i t i"v ft y analyses with respect to various hardware 
parameters. The simulation results show that both sampling and incremental 
techniques provide noticeable savings over conventional methods, with the 
sampling approach being more scalable in supporting very large database 
sys terns . 



